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ABSTRACT 

Ribonucleic acid (RNA)-binding proteins are key 
players of gene expression control. We have shown 
that Gemin5 interacts with internal ribosome en- 
try site (IRES) elements and modulates initiation 
of translation. However, little is known about the 
RNA-binding sites of this protein. Here we show 
that the C-terminal region of Gemin5 bears two non- 
canonical bipartite RNA-binding sites, encompass- 
ing amino acids 1297-1412 (RBS1) and 1383-1508 
(RBS2). While RBS1 exhibits greater affinity for RNA 
than RBS2, it does not affect IRES-dependent trans- 
lation in G5-depleted cells. In solution, the RBS1 
three-dimensional structure behaves as an ensem- 
ble of flexible conformations rather than having a 
defined tertiary structure. However, expression of 
the polypeptide G5 138 3-1508> bearing the low RNA- 
binding affinity RBS2, repressed IRES-dependent 
translation. A comparison of the RNA-binding capac- 
ity and translation control properties of constructs 
expressed in mammalian cells to that of the Gemin5 
proteolysis products observed in infected cells re- 
veals that non-repressive products accumulated dur- 
ing infection while the repressor polypeptide is not 
stable. Taken together, our results define the low 
affinity RNA-binding site as the minimal element of 
the protein being able to repress internal initiation of 
translation. 

INTRODUCTION 

Ribonucleic acid (RNA)-binding proteins play a pivotal 
role in the regulation of gene expression. Due to their ca- 
pacity to interact with different targets, either proteins or 
RNAs, this diverse group of proteins plays multifunctional 



roles in multiple RNA-dependent processes (1-4). In ear- 
lier studies, we identified Gemin5 as one of the proteins 
interacting directly with two different viral internal ribo- 
some entry site (IRES) elements, foot-and-mouth disease 
virus (FMDV) and hepatitis C virus (5). Subsequently, we 
found that it mediates translation repression (6). Gemin5 is 
a peripheral protein of the survival of motor neuron com- 
plex (7), a multiprotein complex found in metazoan cells 

(8) . This complex, which is composed of 7 Sm proteins 
and the Gemins 2-8, plays a critical role on the biogenesis 
of spliceosomal small nuclear ribonucleoproteins (snRNPs) 

(9) . In turn one of the Gemins, Gemin5, has been reported 
to be responsible for the interaction with the Sm site of snR- 
N As (10). Thereby, the finding that Gemin5 down-regulated 
translation (6) revealed a new role for this protein on trans- 
lation control beyond its role in snRNP biogenesis (11). 
However, little is known about how this protein recognizes 
distinct RNA targets. 

Initiation of translation in eukaryotic messenger RNAs 
(mRNAs) involves a set of initiation factors (elFs) that re- 
cruit the small ribosome subunit to the m 7 GTP residue (or 
cap) located at the 5'end of most mRNAs (12). In addi- 
tion, cap-independent mechanisms have been exploited by 
mRNAs translated under conditions that compromise the 
general cap-dependent translation, as it has been illustrated 
in virus infected cells (13-15). To evade cap-dependent in- 
hibition and, at the same time, benefit from the transla- 
tion shut down of the vast majority of cellular mRNAs, 
some viral RNAs have evolved a mechanism based on IRES 
elements. Interestingly, cellular mRNAs translated under 
strong cap-dependent inhibition also make use of IRES ele- 
ments to start translation (16). In the case of picornaviruses 
and other positive-strand viral RNAs, these elements are 
long, heavily structured regions that recruit the ribosomal 
subunits internally, promoting translation initiation at in- 
ternal start codons independent of the 5 'end of the mRNA 
(17-20). IRES-dependent translation of viral and cellular 
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mRNAs depends on a large variety of RNA-binding pro- 
teins, in addition to a subset of elFs (21-23). However, how 
distinct RNA-binding proteins regulate the activity of IRES 
elements remains to be elucidated. 

We have recently shown that Gemin5 recognizes the IRES 
element via its interaction with a stem loop placed at the 
3 'end (24). This was achieved by preparing immunoprecip- 
itated photocrosslinked complexes with soluble cell extracts 
or by using RNA-binding assays with recombinant pro- 
teins. To gain a higher resolution understanding of this in- 
teraction, selective 2'hydroxyl acylation analysed by primer 
extension RNA reactivity was performed with the FMDV 
IRES and purified proteins. This experiment demonstrated 
that incubation of Gemin5 with the IRES induced the spe- 
cific protection of residues within domain 5, out-competing 
the IRES stimulator protein PTB (24). This effect is consis- 
tent with its down-regulatory role in IRES-dependent trans- 
lation. 

Interestingly, Gemin5 is proteolyzed in FMDV infected 
cells by the action of the L protease (25). This feature is 
shared with other host factors involved in gene expres- 
sion control, such as eIF4G, eIF5B, poly(A)-binding pro- 
tein (PABP), polypyrimidine tract-binding protein (PTB), 
Gemin3 and cleavage stimulation factor, together with 
other proteins controlling various steps of RNA biology 
(26-31). 

The three-dimensional structure of full length Gemin5 is 
yet unknown. Notably, we have found that a polypeptide en- 
compassing the C-terminal region of the protein was able to 
interact directly with the IRES to a similar extent than the 
full-length protein (24). Furthermore, in experiments car- 
ried out in parallel the N -terminal region of the protein had 
no IRES-binding capacity, suggesting that the C-terminal 
region harbors all the necessary features to promote an in- 
teraction with the IRES element. Thus, the RNA-binding 
domain recognising the IRES element differs from the re- 
gion responsible for the interaction with snRNAs, which is 
located on the fifth tryptohan-aspartic motif (WD) repeat 
within the N-terminal region of Gemin5 (10). These results 
led us to investigate the possibility that different regions of 
the protein could be involved in the recognition of RNA tar- 
gets with different sequences, structural organisations and 
RNA functions. 

An in silico analysis of the C-terminal region of the 
protein did not reveal any apparent sequence similarities 
to other well-known RNA recognition motifs present in 
canonical RNA-binding proteins. This lack of similarity to 
other RNA recognition motifs prompted us to study the 
molecular basis of the RNA-binding properties of this frag- 
ment and also its capacity to repress translation. Here we 
show that the region of Gemin5 encompassing amino acids 
1383-1508 is coincident with a proteolysis product result- 
ing from viral L protease activity. In addition, it bears an 
RNA-binding site (RBS2). Expression of this polypeptide 
in G5-depleted cells represses IRES-dependent translation 
initiation. Moreover, the RNA-binding capacity of Gemin5 
is not confined to this region but it is further extended up- 
stream of the 1383-1508 motif. Indeed, an overexpressed 
protein, encompassing amino acids 1297-1412, exhibits the 
highest affinity for RNA in both gel-shift and UV-crosslink 
assays. This region (RBS1), however, does no affect transla- 



tion initiation in G5-depleted cells. Interestingly, the RBS1 
fragment in solution does not contain a unique tertiary 
structure. Indeed, the 'H-^N nuclear magnetic resonance 
(NMR) dispersion is poor in agreement with the presence 
of an ensemble of flexible conformations, which probably 
accounts for the functional versatility of the protein. 

MATERIALS AND METHODS 

Constructs 

Plasmids expressing domain 5 of the FMDV IRES el- 
ement (32), CMVpBIC (expressing the bicistronic RNA 
CAT FMDV IRES-luciferase) (6), pcDNA3-G5FLAG 
and pRSETBG5 were described (24). To express proteins in 
mammalian cells, we generated the vector pcDNA3/Xpress 
that allowed the expression of amino terminal HIS-Xpress 
tagged fusion proteins. For this, the Hindlll/EcoRI frag- 
ment from pcDNA3.1-HIS6CDaxx (25) was inserted into 
pcDNA3.1/ZEO (Invitrogen), similarly digested. 

pcDNA3/XpressG5 8 45_i508 was created in two steps. 
First, the polymerase chain reaction (PCR) product 
obtained with the pair of primers G5-ls/G5-10as (see 
Supplementary Table SI for primers) and template 
pcDNA3G5FLAG was inserted into pcDNA3/Xpress 
via BamHI-EcoRI. This construct was used to insert the 
PCR product resulting with primers G5-lls/G5-2as via 
EcoRl-Notl. Construct pcDNA3/XpressG5 8 45_i436 was 
similarly generated using primers G5-lls/G5-12as in the 
second PCR step. To create pcDNA3/XpressG5i287-i508, 
the PCR product obtained with primers G5-5s/G5- 
6as using as template pET-G5 1287-1 508 was inserted 
via BamHI-NotI into pcDNA3/Xpress. Construct 
pcDNA3/XpressG5i383_i508 was generated using primers 
G5-3s/G5^1as and inserted via KpnI-BamHI into 
pcDNA3 /Xpress. pcDNAXpress-G5 1297-1412 was gen- 
erated by PCR with primers G5-7s/G5-8as and pcD- 
NAG5Flag template, then inserted via KpnI-NotI into 
pcDNAXpress. 

To purify proteins expressed in bacteria, the construct 
pET-G5845_i436 (expressing His-G5845-i436) was generated in 
two steps. First, the PCR product obtained using primers 
G5-9s/G5-10as with pCDNA3G5Flag as template was 
inserted via EcoRI-Ndel into pET28a+. The later con- 
struct was used to insert via EcoRI-Sall the PCR prod- 
uct obtained with primers G5-lls/G5-12as on template 
pcDNA3/XpressG5845_i436- The construct pET-G5i287-i508 
expressing His-G5i 2 87-i508 was generated using primers 
G5-13s/G5-14as with pcDNA3G5Flag as template and 
inserted into the EcoRI-XhoI sites of pET28a+. Con- 
struct His-G5i383-i508 was generated by insertion of the 
EcoRI-Hindlll fragment of pcDNA3/XpressG5i383-i5os 
into pET28a+. Construct pRSETB-G5Ct A i365 i394 was 
generated by substitution of the Sph-Hindlll fragment by 
a PCR product obtained with the pair of primers G5- 
15s/G5-16as using as template pcDNA3G5-V5. 

The plasmid pYES2-G5845-iso8 to express Gemin5 in 
yeast was generated via PCR with primers G5-17s/G5- 
18as using as template pcDNA3.1-XpressG5845-i5os, then 
digested with Kpnl-Notl and inserted into pYES2. The nu- 
cleotide sequence of all constructs was verified by sequenc- 
ing (Macrogen). 
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RNA synthesis 

In vitro transcription was performed using T7 RNA 
polymerase with linearized DNA, 40 mM Tris-HCl, 50 
mM DTT, 0.5 mM rNTPs, as described (33). To ob- 
tain RNAs expressing G5i 2 i7-i508, G5 128 7-i508, G5i 2 i7-i436 
or G5i383-i508, the corresponding plasmids were linearized 
with Nhel, Sail or Hindlll. Bicistronic RNAs [cloram- 
phenicol acetyl transferase (CAT)-IRES-luciferase (LUC)] 
were produced as described (34). IRES transcripts were uni- 
formly labelled using a 32 P-CTP (500 Ci/mmol) (35). RNA 
was extracted with phenol-chloroform, ethanol precipitated 
and resuspended in TrislO mM, pH 8, EDTA ImM (TE) 
to a concentration of 0.04 pmol/jjil. RNA integrity was ex- 
amined in 6% acrylamide 7 M urea denaturing gel elec- 
trophoresis (36). 

Expression and purification of proteins 

Escherichia coli BL21 transformed with plasmids 

pET-G5g45_i436, pET-G5i287-1508, pET-G5 1383-1508, pET- 

G5 A i365-D94 grown at 37°C were induced with IsopropylpS- 
D-l-thiogalactopyranoside (IPTG) 0.5 M during 2 h. 
Bacterial cell lysates were prepared in binding buffer 
(20 mM NaH 2 P0 4 , 500 mM NaCl, 20 mM Imidazole) 
using a French press, and cell debris was eliminated by 
centrifugation at 16000 g 30 min at 4°C twice. The lysate 
was loaded in His-GraviTrap columns (HealthCare) and 
the recombinant protein was eluted using Imidazole 500 
mM. Proteins were dialysed against phosphate buffer pH 
6.8, 1 mM DTT, and stored at -20°C in 50% glycerol. 

Saccharomyces cerevisiae W303, grown in rich yeas- 
textract peptone dextrose (YEPD) medium, was trans- 
formed with plasmid pYES2-G5 8 45-i5o 8 using lithium ac- 
etate. Transformants were grown in syntheticcomplete (SC) 
selection medium lacking uracil. Yeast S30 fraction was pre- 
pared as described (37) and subsequently used to purify 
G5g45-i508 in His-GraviTrap columns performing extensive 
washings with 20 mM Imidazole buffer prior to protein elu- 
tion. 

RNA-protein photocrosslinking 

Uniformly radiolabeled RNA (IRES domain 5, 0.04 pmol, 
~5 x 10 5 cpm) was incubated with purified proteins (50-200 
ng) and UV-irradiated as described (6). Following exten- 
sive RNase A treatment samples were subjected to 10-15% 
sodium dodecylsulphate-polyacrylamide gel electrophore- 
sis (SDS-PAGE) and 32 P-labelled proteins were visualized 
by autoradiography (38). 

RNA-protein binding assay 

Ni-NTA/HIS-Gemin5 complexes were assembled by incu- 
bating Ni-NTA agarose resin (25 yA) (Qiagen), pre -washed 
four times with cold BBH(binding buffer Hepes) buffer (10 
mM HEPES; pH 7.4, 100 mM NaCl, 2.5 mM MgCl 2 , 0.01% 
NP-40), with the polypeptide of interest (0.7 pmol) in 100 
julI of BBH buffer during 4 h at 4° C in a rotating wheel. 
Unbound proteins were removed by washing three times 
with cold BBH buffer. 32 P-labelled domain 5 RNA (80 fmol) 
was added to each protein-beads complex and incubated 



in cold BBH buffer in the presence of cytoplasmic RNA 
(300-fold excess), during 2 h at 4°C in a rotating wheel. 
Unbound RNA was removed by washing five times with 
cold BBH buffer. Bound RNAs were extracted with phenol- 
chloroform, ethanol precipitated, and fractionated by 6% 
acrylamide 7 M urea denaturing gel electrophoresis, and vi- 
sualized by autoradiography. 

RNA electrophoretic mobility shift assay 

RNA-binding reactions were carried out for 20 min at 4°C 
in 10 julI of binding buffer [40 mM Tris (pH 7.5), 250 mM 
NaCl, 0.1% (w/v) (3ME]. Increasing amounts of protein 
were incubated with a fixed concentration of 32 P-labelled 
RNA (~2 nM). Electrophoresis was performed in non- 
denaturing 8.0% (29:1) polyacrylamide gels. The gels were 
run for 4 h in IX TBE (Tris-borate-EDTA) buffer (90 mM 
Tris, 64.6 mM boric acid, 2.5 mM EDTA, pH 8.4) at 100 V 
at 4°C. The gels were dried and exposed to a Phosphorlm- 
ager screen overnight. The screen was scanned on a Molec- 
ular Dynamics Storm 840 Phosphorlmager. 

Immunodetection 

Gemin5 was immunodetected by western blot (WB) using 
anti-Gemin5 (Novus) or anti-Xpress (Invitrogen) antibod- 
ies. Immunodetection of tubulin (Sigma) was used as load- 
ing control. Secondary antibodies (Thermo Scientific) were 
used according to the manufacturer instructions. 

Gemin5 siRNA interference and Gemin5 polypeptides ex- 
pression in mammalian cells 

siRNAs targeting Gemin5 mRNA (siG5.1 GCAUAGUG- 
GUGAUAAUUGAUU, siG5.2 CAAUGAAGAUG- 
GAUCAAUAUU, siG5.3 CCUUAAUCAAGAAGA- 
GAAAUU) or a control sequence (control siRNA 
AUGUAUUGGCCUGUAUUAGUU) were purchased 
from Dharmacon. HEK293 cells grown to 70% confluent 
were treated with 100 nM siRNA using lipofectamine 2000 
(Invitrogen) according to the manufacturer instructions, 
30 h prior to transfection with plasmids expressing the 
desired G5 polypeptides with, or without, of CMVpBIC 
construct, using lipofectamine (Invitrogen). The total 
amount of DNA in the transfection mixture was balanced 
with empty vector, pcDNA3.1. Cell lysates were prepared 
24, 48 or 56 h later in 100 (jul of lysis buffer (50 mM 
Tris-HCl pH 7.8, 100 mM NaCl, 0.5% NP40) and the 
protein concentration in the lysate was determined by 
Bradford assay. LUC and CAT activity was determined 
in each lysate. Values represent the mean ± SD of three 
independent assays. Equal amounts of protein were loaded 
in SDS-PAGE to determine the efficiency of interference 
as well as the efficiency of translation of the transfected 
Gemin5 polypeptides. 

IRES activity assays 

IRES activity was quantified as the expression of LUC 
normalized to the amount of protein in the lysate, while 
CAT activity reported cap-dependent translation from bi- 
cistronic mRNAs in transfected HEK293 cell monolayers 
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(6). Each experiment was carried out in duplicate wells and 
repeated independently at least three times. Values represent 
the mean ± SD. Differences in IRES-dependent transla- 
tion (LUC activity) induced by the coexpression of Gemin5 
polypeptides were analysed by a paired two-sided Student t- 
test; differences were considered significant when P < 0.01. 

NMR spectroscopy 

All experiments were recorded on a Bruker Avance III 
600-MHz spectrometer equipped with a z pulse field gra- 
dient unit and either a triple ('H, 13 C, 15 N) or quadru- 
ple (plus 31 P) resonance probe heads. Double labelled 
( 15 N and 13 C) samples were prepared to obtain sequence- 
specific assignments using HNCACB/HN(CO)CACB or 
CBCA(CO)NH/CBCANH experiments. The samples were 
prepared in 50 mM Phosphate buffer with 150 mM 
NaCl, at pH 6.7 All spectra were processed with the 
NMRPipe/NMRDraw (39) software and were analysed 
with Computer Aided Resonance Assignment software 
(CARA) (40). 

NMR titration experiments with RNA fragments 

For the 15 N-HSQC experiments, 15 N-labelled protein do- 
mains were prepared at 0.25 mM concentration, in the same 
buffer as described above, and unlabelled ligand was added 
to the sample up to a final molar ratio 3:1. Measurements 
were performed at 285 K or 295 K. 

RESULTS 

Direct interaction of Gemin5 with the IRES element involves 
two sites within the C-terminal region 

We have shown that Gemin5 has the ability to bind directly 
to domain 5 of the FMDV IRES element (24). Therefore, to 
identify the specific region of Gemin5 involved in the inter- 
action with the IRES element, we conducted UV-crosslink 
assays using increasing amounts of various overexpressed 
polypeptide fragments corresponding to the C-terminal re- 
gion of Gemin5 (Figure 1A) using radiolabelled domain 5 
as probe. The N-terminal region (residues 1-1288) of the 
protein, encompassing the WD repeats, was used as a con- 
trol. Incubation of polypeptide G5845-1508 with 32 P-labelled 
domain 5 followed of UV irradiation resulted in the detec- 
tion of a 85 kDa product that migrated with a polypep- 
tide of similar size detected by WB (Figure IB). Similarly, 
a band of about 75 kDa was observed using the G5845-1436 
purified polypeptide in UV-crosslinking coincident with the 
protein observed by WB. An asterisk in this panel depicts a 
non-specific product. No detectable UV-crosslinking prod- 
uct was observed with G5i_i288- 

Since the C-terminal region G5i287-i508 was shown to in- 
teract directly with IRES RNA (24), we divided this region 
in two products aimed to map precisely the RNA-binding 
domain of Gemin5. Interestingly, a prominent interaction 
was observed with purified G51297-1412, which had the same 
mobility as the peptide in a Coomassie Blue stained gel 
(Figure IB). Similarly, and although more weakly, a band 
of about 22 kDa was observed in UV-crosslinking carried 
out with purified G5i383-iso8, which was also coincident 



with the protein observed by WB (Figure IB). Therefore, 
we conclude that a bipartite RNA-binding site, comprising 
residues 1297-1412 (RBS1) and 1383-1508 (RBS2), within 
the C-terminal region of Gemin5 enables a direct interac- 
tion with the IRES element with RBS1 being more intense 
than RBS2 in UV-crosslinking experiments. 

To further support these findings, we used overexpressed 
proteins in RNA-pull down assays. Three constructs includ- 
ing an N-terminal HIS-tag to facilitate the protein purifi- 
cation were prepared, full-length Gemin5 (amino acids 1- 
1508), and the two halves, G5i 2 87-i5os and G5n 2 88 (Figure 
1C). Then, 32 P-labelled domain 5 was incubated with the 
HIS-tagged purified proteins immobilized on pre-washed 
Ni-agarose beads. Following extensive washing, bound 
RNAs were isolated and analysed by denaturing urea poly- 
acrylamide gel electrophoresis. The mobility of the isolated 
RNAs was similar to that of the free probe (input RNA, 
lane 1 ). A negative control containing RNA incubated with 
Ni-agarose beads free from protein was used as a measure of 
the background signal (lane 2). The RNA-binding capacity 
of the C-terminal region (G5i 2 87-i508, lane 5) was similar to 
that of the full-length protein (G5i_i5os, lane 3) and above 
the signal observed with the N-terminal fragment G5i_ 12 88 
(lane 4). Notably, polypeptides G5i 3 83_i508 and G51297-1412 
(Figure 1C, lanes 8 and 10, respectively) exhibited a positive 
RNA-binding capacity relative to the background signal 
(lane 6), which was also observed with the longer overlap- 
ping polypeptides G5 8 45-i508 and G5 8 45-i436 (lanes 7 and 9). 
This result was in full agreement with data obtained in UV- 
crosslinking assays, reinforcing our conclusion that a bipar- 
tite site spanning amino acids 1297-1412 (RBS1) and 1383- 
1508 (RBS2) within the most C-terminal region of Gemin5 
is necessary and sufficient to interact with the FMDV IRES. 

N-terminal deletions of the RNA-binding site 1 severely de- 
crease binding capacity 

The RNA-binding capacity of G5 1297-1412 observed in 
photo-crosslinking assays (Figure IB) prompted us to de- 
fine in further detail the boundaries of RBS1 using gel- 
shift assays with radiolabelled domain 5 and purified HIS- 
tagged proteins. As shown in Figure 2A, the polypeptide 
G5 1297-1412 exhibited the highest RNA-binding capacity, a 
Kd of approximately 20 |xM, since two constructs harbour- 
ing N-terminal deletions of 10 and 43 residues (G51307-1412, 
G5 1340-1412, respectively) showed a dramatic decrease in 
RNA binding, a Kd in excess of 350 |jlM. These results 
suggest that the entire RBS1 (blue shaded region in Fig- 
ure 2B), which includes two serine residues at its coil-coiled 
N-terminus, is important for RNA binding. On the other 
hand, according to secondary prediction models of the 
Gemin5 C-terminal region, the RBS2 (brown shaded region 
in Figure 2B) comprises four helices, of which the second 
(H2) and third (H3) ones are enriched in leucine and glu- 
tamine residues, respectively. 

Next, we attempted to generate a structural model of 
polypeptide 1297-1412 which showed the highest RNA- 
binding capacity. All experimental conditions we assayed 
yielded samples that provide NMR data in agreement with 
the absence of tertiary structure (Supplementary Figure 
SI). Only ~80% of the signals corresponding to the RBS1 
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Figure 1. Identification of the IRES-binding site in Gemin5. (A) Schematic of HIS-tagged polypeptides used in RNA-binding assays. Numbers indicate 
amino acid residues referred to the full-length protein. Grey ovals depict WD motifs located within the N-terminal region of the protein. A triangle within 
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Urea loaded with RNAs isolated from Ni-agarose beads coupled to the indicated proteins (0.07 pmol). D5 and RNAc are used for radiolabelled domain 
5 and total cytoplasmic RNA, respectively. 



domain were denned in the spectrum, precluding the unam- 
biguous assignment of all resonances. Still, we have assigned 
some fragments of the sequence using standard backbone 
triple resonance experiments. Based on these assignments 
and on the dispersion of the NMR signals shown in the 
spectrum, we conclude that the RBS1 construct has a short 
helical conformation surrounded by unstructured regions, 
yielding the ensemble of flexible conformations observed 
in 2D and 3D NOESY (Nuclear Overhauser effect spec- 
troscopy) experiments. 

Expression of truncated Gemin5 proteins in mammalian cells 
reveals differences in protein accumulation 

In order to assign the translational regulatory capacity to 
specific regions of the Gemin5 protein, we generated a set 



of constructs that allowed the expression of partially over- 
lapping polypeptides in mammalian cells (Figure 3A). Im- 
munodetection with anti-Gemin5 or anti-Xpress antibodies 
monitored polypeptide expression levels at different post- 
transfection times. Protein loading levels were verified us- 
ing anti-tubulin. The results obtained indicated that each 
of these proteins was differentially accumulated in the cell, 
requiring different expression times to detect an equivalent 
amount of each polypeptide. For instance, accumulation 
of G5845-1508 and G5s45_i436 in HEK293 cells was very ef- 
ficient at 24 h post-transfection time (hpt), remaining sta- 
ble up to 48 hpt (Figure 3B). Under the same conditions, 
expression of G5i287-i508 was efficient up to 30 hpt, declin- 
ing by 48-56 hpt. Surprisingly, when G5i 2 87-i508 was split 
into two fragments, the G5i383-iso8 showed a clear detec- 
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Figure 2. Determination of the RNA-binding capacity of Gemin5 truncated polypeptides. (A) Gel-shift assay conducted with increasing amounts of puri- 
fied His-tagged Gemin5 polypeptides depicted at the top and radiolabeled domain 5 RNA, fractionated in native gels and visualized by autoradiography. 
(B) Gemin5 C-terminal region structure prediction. Jpred3 program (Jnet version: 2.2, www.compbio.dundee.ac.uk) was used for the secondary structure 
prediction analysis. Helical prediction scores below 7 are shown in light grey and scores between 8 and 9 in dark grey. High scores represent higher prob- 
ability of adopting a helical conformation. Coil-coiled regions are displayed as a grey thick line and labelled in white Cs. Blue and brown boxes represent 
the boundaries of the two regions analyzed; dark brown box depicts the overlapping sequence. 



tion only 48-56 hpt (Figure 3B) while G5 1297-1412 exhibited 
a uniform expression over a period from 24 to 56 hpt (Fig- 
ure 3C). Accordingly, the ratio of the ectopically expressed 
protein to the endogenous Gemin5 (immunodetected with 
the same antibody in the same sample) was about 2-fold for 
G5 8 45-i508, G5 8 45-i436 and G5 1287-1508 24 hpt. In contrast, 
the G5 1297-1412 ratio reached maximum values (around 1) 
30 hpt while G5i383-i5os needed 48 hpt to detect similar ex- 
pression levels (Figure 3D). Therefore, the post-transfection 
time necessary to accumulate similar levels of ectopically 
expressed protein varies in accordance with the region of 
Gemin5 expressed in mammalian cells. 



Expression of Gemin5 polypeptides reveals that RNA-binding 
site 2 acts as a translation repressor 

With the aim to determine the influence of Gemin5 
polypeptides on translation control, we designed siRNAs 
to deplete Gemin5 in HEK293 cells (Supplementary Fig- 
ure S2), of which siG5.3 was the most efficient 48 hpt, re- 
maining active up to 72 hpt. In comparison to a control 
siRNA, the amount of Gemin5 protein detected by WB was 
reduced by 90% with siG5.3 (Figure 4A). To monitor the ef- 
fect of Gemin5 depletion on translational efficiency, we used 
the construct CMVpBIC that allows the synthesis of a sin- 
gle bicistronic transcriptional unit (Figure 4B) in which the 
expression of LUC is monitored to assay IRES-dependent 
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Figure 3. (A) Schematic of Gemin5 polypeptides expressed in transfected cells. Numbers indicate amino acid residues referred to in the text. Ovals depict 
WD motifs located within the N-terminal region of the protein. A triangle within the C-terminal region of the protein depicts the position of the epitope 
recognized by Gemin5 antibody. (B) Analysis of the optimal time of expression of Gemin5 polypeptides in HEK293 cells. Plasmids expressing the indicated 
proteins were transfected in HEK293 cells; transfected (+) and mock-transfected (— ) cells were harvested at the indicated time (h) in lysis buffer. Equal 
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antibody. Arrows depict the position of the full-length Gemin5 (G5) as well as the expressed proteins. Asterisks depict unspecific products. (C) Expression 
of G51297-1412 determined by anti-Xpress WB; loading control was assessed by anti-tubulin. (D) Ratio of the level of expression of Gemin5 polypeptides 
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translation while the levels of CAT are monitored to as- 
say 5 'end-dependent translation initiation. The efficiency of 
LUC expression in Gemin5-depleted cells normalized to the 
activity observed in cells transfected with a control siRNA 
indicated that depletion of Gemin5 induced an increase of 
IRES-dependent translation reported by LUC activity, as 
indicated by the P values (P < 0.005) obtained in the t-test 
(Figure 4B). This result is in agreement with previous data 



using shRNAs targeting a different sequence of Gemin5 
mRNA (6). 

Expression of Gemin5 polypeptides in G5-depleted cells 
was used to assess the effect of different C-terminal regions 
of the protein in translation control (Figure 5A). The effi- 
ciency of depletion was monitored by immunodetection on 
the same membrane as that used to monitor the expression 
of the desired Gemin5 polypeptide (Figure 5B-D) using 
Gemin5 antibody, with the exception of G5i 2 97_i4i2 that was 
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cap IRES 




Figure 4. Effect of Gemin5 depletion on protein synthesis. (A) WB anal- 
ysis of HEK293 cell extracts transfected with siRNA G5.3 targeted to 
Gemin5 or a control siRNA with no target sequence in mammalian mR- 
NAs. Tubulin was used as loading control. (B) Gemin5-depleted cells were 
used to monitor IRES- and cap-dependent translation upon transfection 
with equal amounts of the bicistronic plasmid (diagram at the top). Each 
experiment was repeated at least three times in duplicate wells. Effect on 
protein synthesis was calculated as the% of CAT (striped bars) and LUC 
(black bars) values observed in the control siRNA. Values represent the 
mean ± SD. (***P < 0.005). 



detected with anti-Xpress (Figure 5C). Loading control was 
verified by anti-tubulin immunoblot (Figure 5B-D). The ef- 
fect on translation efficiency was measured by the expres- 
sion of CAT (cap-dependent) and LUC (IRES-dependent) 
from the co-transfected plasmid CMVpBIC. In all cases, 
IRES-dependent translation was normalized to the levels of 
LUC observed in cells treated with the control siRNA (set 
to 100%) (empty bars in Figure 5B-D). Gemin5 depletion 
induced an increase of about 1.5-fold in IRES-dependent 
translation initiation in every experimental setting (black 
bars in Figure 5B-D), in agreement with earlier data (6). Ex- 
pression of G5 1287-1 508 affected IRES-dependent translation 
in a dose-dependent manner behaving as a stimulator of 
IRES-dependent translation (P < 0.005) (Figure 5B). How- 
ever, expression of G5i297-i4i2, corresponding to RBS1, did 
not alter the levels of IRES-dependent expression (Figure 
5C) despite similar levels of the truncated Gemin5 polypep- 
tide being detected. Therefore, none of these polypeptides 
mimic the repressor effect observed with the full-length pro- 
tein (Figure 4A). 

Next, using the same experimental procedure, we ob- 
served that the most C-terminal region of the protein con- 
taining RBS2, G5d83-i508, induced a dose-dependent in- 
hibition of IRES-dependent translation, reaching levels 
similar to those observed in the control siRNA-depleted 
cells (Figure 5D). In this case, longer post-transfection 
times were required to detect a similarly intense polypep- 
tide, consistent with expression data shown in Figure 3B. 
No significant changes in cap-dependent translation mon- 
itored by CAT expression were observed by expression of 



these polypeptides (Supplementary Figure S3). The IRES- 
dependent inhibitory capacity of G5i 3 83_i508 was also ob- 
served in in vitro translation assays when its mRNA was ex- 
pressed immediately prior to bicistronic RNA (Supplemen- 
tary Figure S4). 

Using a similar experimental procedure (Figure 6 A), we 
analysed the effect of proteins G5845_i508 and G5845-1436, ad- 
justing the expression time of each protein to the optimal 
conditions shown in Figure 3B. Analysis of these proteins 
was of interest due to the fact that these polypeptides cor- 
respond to Gemin5 proteolysis fragments detected in in- 
fected cells, p85 and p57, respectively (25). While the p85 
product appeared as a weak band in infected cells, p57 was 
accumulated during infection. Expression of G5845_i508 in 
G5-depleted cells showed a stimulation of IRES-dependent 
translation (P < 0.005) (Figure 6B), consistent with the 
G5 1287-1508 data (Figure 5B). In contrast, expression of 
G5845-1436, lacking the most C-terminal region of G5 (Fig- 
ure 2B), had no effect on internal initiation (Figure 6C). 

Collectively, these results led us to conclude that the 
IRES-dependent translational down-regulation of Gemin5 
observed in tissue culture cells specifically resides in RBS2, 
within amino acids 1383-1508 corresponding to the most 
C-terminal region of the protein. 

Relationship between the capacity to control translation initi- 
ation in mammalian cells and the proteolysis products found 
in infected cells 

As mentioned above, Gemin5 is proteolyzed in FMDV in- 
fected cells by the action of the L protease, rendering at 
least two products p85 and p57 (25). The recognition motif 
for the L protease rendering p85 included a stretch of basic 
amino acids (arginine-lysine-alanine-arginine) RKAR (po- 
sition 845) (Figure 7A), partially coincident with the cleav- 
age site within the viral polyprotein (L-VP4) and Daxx. 
The aforementioned data suggested that a cleavage site near 
the C-terminal end could be responsible for the p57 prod- 
uct. However, no products <20 kDa were detected by WB 
with the available antibodies (either anti-Gemin5 or anti- 
FLAG using FLAG-tagged overexpressed proteins) in in- 
fected cells. In addition, the protease recognition motif was 
difficult to predict due to the large diversity between L pro 
cleavage sites in host factors eIF4GI or eIF4GII (27,41). 
Furthermore, whether or not these products play a role on 
translation control remained to be elucidated. 

To gain information about the L protease cleavage site 
near the C-terminal end of Gemin5, we used the informa- 
tion available for Gemin5 p85, L-VP4 and Daxx (Figure 
7A) searching for similarities in amino acid sequence. The 
motif (threonine-lysine-arginine-leucine) TKRL located in 
G5 1383-1508 downstream of the epitope recognized by the 
Gemin5 antibody was a promising candidate. To determine 
whether this sequence provided an authentic L pro recogni- 
tion motif, we used the transfection assay successfully used 
to identify the RKAR motif (25). Mutational analysis of 
the truncated FLAG-tagged Gemin5 sequence (p40) indi- 
cated that substitution of basic amino acids by glutamic or 
proline (KR (lysine-arginine) to EE (glutamic-glutamic) or 
PP (proline-proline)) fully abrogated the cleavage by L pro- 
tease in transfected cells (Figure 7B). This result indicated 
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that cleavage around the TKRL motif of Gemin5 generates 
a small peptide (p25), corresponding to the most C-terminal 
fragment of the protein. 

The identified L pro cleavage products (Figure 7) and (25) 
are closely coincident with the G5 polypeptides G5g45-i508, 
G584 5 _i436, G5b83_i508, analysed in Figures 5 and 6. There- 
fore, given their differential capacity to mediate IRES- 
dependent translation in transfected cells it is tempting to 



suggest that the proteolysis products accumulated in in- 
fected cells may exert similar effects on translation initiation 
than the Gemin5 polypeptides analysed in this study. 

DISCUSSION 

The focus of this work was to decipher the critical regions 
of Gemin5 responsible for both an interaction with the 
IRES element and to mediate translation regulation. Here 
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we show that two minimal regions of Gemin5 encompass- 
ing amino acids 1297-1412 (RBS1) and 1383-1508 (RBS2) 
possess RNA-binding capacity, being responsible for the in- 
teraction with the 3 'end domain of a picornavirus IRES el- 
ement. By expressing various fragments of the protein in 
Gemin5-depleted cells, we show that the polypeptide 1383- 
1508 specifically down-regulates IRES-dependent transla- 
tion in mammalian cells. In addition, this polypeptide re- 
tains the capacity to interact directly with the IRES RNA 
in vitro in the absence of other factors. 

By using UV-crosslinking, RNA-binding pull down and 
gel-shift assays, we show that two short regions of Gemin5, 
RBS1 and RBS2, are sufficient to bind directly to do- 
main 5 of the FMDV IRES, with RBS1 having a higher 
affinity than RBS2. Longer polypeptides including these 
RNA-binding sites also interact with the IRES. Further- 
more, deletion mutant G5 A B65 i394 removing a long region 
of RBS1 (see Figure 2B) retains RNA-binding capacity in 
UV-crosslinking (Supplementary Figure S5), demonstrat- 
ing that the C-terminal region of RBS1 is dispensable for 
RNA binding. In contrast, no RNA-binding capacity above 
the background signal was found for an N-terminal frag- 
ment corresponding to two thirds of the protein. 

No apparent similarity has been detected between the 
Gemin5 C-terminal region and known RNA-binding mo- 
tifs. Therefore, the RNA-binding capacity of the C-terminal 



region of Gemin5 was unpredictable using bioinformatics' 
approaches. To get some insights into the molecular basis 
of this interaction, we analysed by NMR the polypeptide 
carrying RBS1 that showed the highest RNA-binding ca- 
pacity. However, only ~80% of the signals were defined in 
the spectrum (Supplementary Figure SI), precluding the as- 
signment of all resonances. Based on the assignments of 
some fragments of the sequence using standard backbone 
triple resonance experiments and on the dispersion of the 
NMR signals shown in the spectrum, we conclude that the 
RBS1 region has a short helical conformation surrounded 
by unstructured regions. The presence of these elements of 
secondary structure is in agreement with the obtained sec- 
ondary structure prediction (Figure 2B). 

Intrinsically unstructured proteins (IUPs) are generally 
flexible in order to perform their functions while others 
fold only in complex with target structures (42). IUPs play 
roles in cell signalling and cell cycle regulation, DNA recog- 
nition molecules, protein-RN A recognition, among others 
(43-45). Compared to sequences of ordered proteins, disor- 
dered protein sequences are depleted in I, L, V, W, F, Y and 
C, and enriched in E, K, R, G, Q, S, P and A. As a con- 
sequence of the low representation of hydrophobic amino 
acids in a given protein sequence, the abundant hydropho- 
bic interactions present in folded structures are nearly ab- 
sent and display physicochemical characteristics resembling 
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Figure 7. Mutational analysis of L protease cleavage site on Gemin5. (A) 
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those of random coils. When analysed by NMR, disordered 
samples display poor signal dispersion, as we observed in 
the NMR data acquired with the Gemin5 construct stud- 
ied. This flexibility probably represents a great advantage 
for a scaffold protein since the protein can easily select the 
best conformation among the ensemble to recognize many 
ligands. However, in our specific case it hampers the identi- 
fication of the residues involved in RNA recognition, since 
both bound and free protein chemical shifts are in fast ex- 
change. 

According to secondary prediction models of the Gemin5 
C-terminal region, RBS2 comprises four helices (Figure 
2B). Interestingly, the sequence of the second one (H2) is en- 
riched in leucine (LECCLVLLLI) while the third predicted 
helix (H3) contains a glutamine-rich (QEMQQQAQELLQ) 
sequence. Despite none of these sequences are canon- 
ical RNA-binding motifs, leucine-rich and particularly, 
glutamine-rich motifs are known to mediate the interaction 
of proteins with RNA, as reported in the case of LRPRPC 
protein (46) and Tia 1 (47), giving support to the RNA- 
binding capacity of the G5i383-i5os shown in this study. 

Functional analysis involving expression of various 
Gemin5 polypeptides in mammalian cells with reduced 
amounts of the endogenous protein provided insights into 
the mechanism mediating translation control. The repres- 
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sor effect of Gemin5 on IRES-dependent translation pre- 
cisely maps to residues 1383-1508, corresponding to RBS2. 
In contrast, overexpression of the G5 1297-1412 or G5s45-i436 
fragments, containing RBS1, did not appreciably affect 
translation efficiency, whereas expression of G5s45-i508 and 
G5 1287-1508 stimulated translation slightly. These differential 
effects on translation are summarized in a model (Figure 
8) that illustrates the mechanism deciphered in this study. 
Gemin5 is proteolyzed during FMDV infection generating 
at least two fragments, p85 and p57 (25), corresponding to 
G5845-1508 and G5s45-i436, respectively. Proteolysis was ob- 
served at similar times as PABP or PTB, and later than 
eIF4G, presumably contributing to inhibit cellular gene ex- 
pression by eIF4G -independent mechanisms. Given the fact 
that the amounts of p85 and p57 in infected cells were 
differentially accumulated (25), we investigated the poten- 
tial role of each of these products in translation control. 
Remarkably, while the full-length protein down-regulates 
translation, G845-1436 corresponding to the product accu- 
mulated in infected cells does not impair IRES-dependent 
translation. Instead, the short peptide G5i383-i508 that was 
undetected in infected cells, consistent with the observa- 
tion that its expression declined over time (Figure 3), re- 
presses translation. In contrast, its counter part G5 1297-1412 
does not affect translation efficiency, as it also happens with 
G5845-1436- Thus, we would like to conclude that the re- 
pressor polypeptide G5 1435-1508 produced in infected cells 
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is not stable while the stable products (p85, p57) do not 
interfere IRES-dependent translation. Indeed, G5 8 45_i508 
slightly stimulates translation. The close correlation be- 
tween the effect on IRES-dependent translation reported 
in this study and the amounts of proteolysis products ac- 
cumulated in infected cells is consistent with the hypothe- 
sis that conformational changes of the protein induced by 
the differential RNA-binding capacity of RBS1 and RBS2 
and/or their capacity to interact with other factors are at 
the basis of the role of Gemin5 on translation control (Fig- 
ure 8). Conversely, changes in RNA structure induced by 
the binding of the IRES to RBS1 or RBS2 depending upon 
the Gemin5 polypeptides could also be compatible with the 
differential effect on translation. Further research to clar- 
ify the conformations adopted by these domains might con- 
tribute to illuminate the crosstalk between the elements in 
the bipartite RNA-binding site of Gemin5. 

In addition to these properties, Gemin5 was described 
as a scaffolding protein with the capacity to interact with 
eIF4E (48) as well as m 7 GTP-resin (49), two features that 
link Gemin5 with translation events. Gemin5 that is found 
in the cell cytoplasm (50) was recently included in the 
mRNA interactome catalogue (51), congruent with a mul- 
tifunctional role of this protein. It is likely that Gemin5 
may recruit (or interfere with) other factors that also have 
mRNA-binding capacity and thus regulate translation. Im- 
portantly, we have discovered that Gemin5 directly binds 
to the IRES through a specific stem loop allowing a large 
degree of RNA sequence flexibility (24). The RNA-binding 
capacity of this region of Gemin5 differs from that reported 
to recognize the sm site of snRNAs (10). This difference 
suggests the existence of several RNA targets recognized 
by specialized domains likely assembled in distinct func- 
tional complexes. Further studies will be necessary to iden- 
tify novel targets of Gemin5 and understand the possible 
role of this protein on the expression of other mRNAs 
within the cell. 
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